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Abstract 

N-fold integer programming is a fundamental problem with a variety of 
natural applications in operations research and statistics. Moreover, it is uni- 
versal and provides a new, variable-dimension, parametrization of all of integer 
programming. The fastest algorithm for n-fold integer programming predat- 
ing the present article runs in time O (n^^^^L^ with L the binary length of 
the numerical part of the input and g{A) the so-called Graver complexity 
of the bimatrix A defining the system. In this article we provide a drastic 
improvement and establish an algorithm which runs in time O (n^L) having 
cubic dependency on n regardless of the bimatrix A. Our algorithm can be 
extended to separable convex piecewise affine objectives as well, and also to 
systems defined by bimatrices with variable entries. Moreover, it can be used 
to define a hierarchy of approximations for any integer programming problem. 

1 Introduction 

N-fold integer programming is the following problem in variable dimension nt, 

min {wx : A^^^x = b , 1 < x < u , x G Z"*} , (1) 

where 
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is an (r + ns) x nt matrix which is the n-fold product of a fixed (r, s) xt bimatrix A, 
that is, of a matrix A consisting of two blocks Ai, A2, with Ai its r x t sub matrix 
consisting of the first r rows and A2 its s x t submatrix consisting of the last s rows. 

*Technische Universitat Munich, Germany. 

^Technion - Israel Institute of Technology. Supported in part by the Israel Science Foundation. 
■fTechnion - Israel Institute of Technology. Supported in part by the Technion Graduate School. 



1 



2 



This is a fundamental problem with a variety of natural applications in operations 
research and statistics which, along with extensions and variations, include multiin- 
dex and multicommodity transportation problems, privacy and disclosure control in 
statistical databases, and stochastic integer programming. We briefly discuss some 
of these applications in Section 15) ( Corollaries 15 . 1 1 and 15 . 2 p . For more information see 
e.g. [ll[5l[g|9l[IOl[IIl[T2l[Tl[T5l[20],[I71 Chapters 4,5], and the references therein. 

Moreover, n-fold integer programming is universal [6] and provides a new, variable- 
dimension, parametrization of all of integer programming: every program is an n-fold 
program for some m over the bimatrix A := A{m) with first block Ai the 3m x 3m 
identity matrix and second block A2 the (3 + m) x 3m incidence matrix of the 
complete bipartite graph K^ ^n- We make further discussion of this in Section [71 

The fastest algorithm for ra-fold integer programming predating the present arti- 
cle is in [To] and runs in time O {n^^^^L^ with L = (w, b, 1, u) the binary length of the 
numerical part of the input, and g{A) the so-called Graver complexity oi the bimatrix 
A. Unfortunately, the Graver complexity is typically very large [21 [18]: for instance, 
the bimatrices A[m) mentioned above have Graver complexity g{A{m)) = fi(2"*), 
yielding polynomial but very large n*^^^™^ dependency of the running time on n. 

In this article we provide a drastic improvement and establish an algorithm which 
runs in time O (n^L) having the cubic dependency on n which is alluded to in the 
title, regardless of the fixed bimatrix A. So the Graver complexity g{A) now drops 
down from the exponent of n to the constant multiplying n^. This is established in 
Section [3l (Theorem I3.9P . Moreover, our construction can be used to define a natural 
hierarchy of approximations for ([T]) for the bimatrices A{m) with m variable, and 
therefore, by the universality theorem of [6], for any integer programming problem. 
These approximations are currently under study, implementation and testing, and 
will be discussed briefly in Section [71 and in more detail elsewhere. 

Our algorithm extends, moreover, for certain nonlinear objective functions: using 
results of [16] on certain optimality criteria, we provide in Section [H an optimality 
certification procedure for separable convex objectives whose time complexity is lin- 
ear in n (Theorem l4.ip and an algorithm for solving problems with separable convex 
piecewise affine objectives whose time complexity is again cubic in n (Theorem 14. 2p . 
Furthermore, the algorithm also leads to the first polynomial time solution of n-fold 
integer programming problems over bimatrices with variable entries (Theorem 16. ip . 

2 Notation and preliminaries 

We start with some notation and review of some preliminaries on Graver bases and 
n-fold integer programming that we need later on. See the book [17] for more details. 

Graver bases were introduced in [8] as optimality certificates for integer program- 
ming. Define a partial order □ on H." by x C y if Xii/i > and \xi\ < \yi\ for all i. 
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So C extends the coordinate-wise partial order < on the nonnegative orthant M" to 
all of M". By a classical lemma of Gordan, every subset Z C Z" has finitely- many 
□-minimal elements, that is, x G Z such that no other y G Z satisfies y ^ x. 

We have the following fundamental definition from [8]. 

Definition 2.1 The Graver basis of an integer m x n matrix A is defined to be the 
finite set Q{A) C of C-minimal elements in {x G Z" : Ax = 0, x 7^ 0}. 

For instance, the Graver basis of the matrix A := {1 2 1) consists of 8 vectors, 
g{A) = ±{(2 -1 0), (0 -1 2), (1 -1), (1 -1 1)} . 

Consider the general integer programming problem in standard form, 

min {wx : Ax = b , 1 < x < u , x G Z"} . (3) 

A feasible step for feasible point x in (jS]) is any vector v such that x -|- v is also 
feasible, that is, A\ = so A(x + v) = b, and 1 < x + v < u. An augmenting step 
for X is a feasible step v such that x + v is better, that is, wv < so w(x + v) < wx. 

Graver has shown that a feasible point x in (jS]) is optimal if and only if there is 
no element g G g{A) in the Graver basis of A which is an augmenting step for x. 

This suggests the following simple augmentation scheme: start from any feasible 
point in ([3]) and iteratively augment it to an optimal solution using Graver aug- 
menting steps g G g{A) as long as possible. While the number of iterations in this 
simple scheme as is may be exponential, it was recently shown in [10] that if in each 
iteration the best possible augmenting step of the form 7g with 7 positive integer 
and g G g{A) is taken, then the number of iterations does become polynomial. 
In what follows, we call an augmenting step which is at least as good as the best 
possible augmenting step 7g with g G g{A), a Graver-best augmenting step. 

It was shown in [5] that for fixed bimatrix A, the Graver basis Q (A^"^) of the 
n-fold product of A can be computed in time polynomial in n. Thus, to find a 
Graver-best augmenting step of the form 7g for an n-fold integer program ([1]), it is 
possible, as shown in [TU], to check each element g E G (A*-"-*), and for each, find the 
best possible step size 7. However, as we explain below, the Graver basis Q (A*^"-') is 
very large. Therefore, in this article, we do it the other way around. For each of 0{n) 
critical positive integer potential step sizes 7, we determine an augmenting step 7h 
which is at least as good as the best possible augmenting step 7g with g E Q (A*^")). 
We then show that the best among these steps over all such 7 is a Graver-best step. 

In preparation for this, we need to review some material on Graver bases of re- 
fold products. Let A be a fixed integer (r, s) x t bimatrix. For any n we write each 
vector X G Z"* as a tuple x = (x^, . . . ,x") of n bricks x* G Z*. It has been shown 
in [1], [18], and [13], in increasing generality, that for every bimatrix A, the number 
of nonzero bricks appearing in any element in the Graver basis Q (A^"^) for any n is 
bounded by a constant independent of n. So we can make the following definition. 
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Definition 2.2 The Graver complexity of an integer bimatrix A is defined to be tlie 
largest number g{A) of nonzero bricks g* in any element g G ^ (A*^'"^) for any n. 

This was used in [5] to show that the Graver basis Q^A^"^^) has a polynomial 
number 0{n^^'^'>) of elements and is computable in time 0{n^^^'^) polynomial in n. 
Thus, the computation of a Graver-best augmenting step in [10] was done by finding 
the best step size 7 for each of these 0{n^'^'^^) elements of Q{A^'^^), resulting in 
polynomial but very large ©(n^*-"^^) dependency of the running time on n. In Section 
Owe show how to find a Graver-best step without constructing Q{A^'^^) explicitly in 
quadratic time O(n^) regardless of the bimatrix A and its Graver complexity. 

We conclude this subsection with some remarks about complexity and finiteness. 
The binary length of an integer number z is the number of bits in its binary encoding, 
which is 0(log l^l), and is denoted by {z). The binary length (z) of an integer vector 
z is the sum of binary lengths of its entries. We denote by L the binary length of all 
numerical part of the input. In particular L = (w, b, 1, u) for problem ([1]). All num- 
bers manipulated by our algorithms remain polynomial in the binary length of the 
input and our algorithms are polynomial time in the Turing machine model. But we 
are mostly interested in the number of arithmetic operations performed (additions, 
multiplications, divisions, comparisons), so time in our complexity statements is the 
number of such operations as in the real arithmetic model of computation. 

For simplicity of presentation we assume throughout that all entries of the bounds 
1, u in program ([T]) are finite and hence the set of feasible points in ([T]) is finite. This 
is no loss of generality since, as is well known, it is always possible to add suitable 
polynomial upper and lower bounds without excluding some optimal solution if any. 

3 The algorithm 

We now show how to decide if a given feasible point x = (x^, . . . , x") in ([1]) is optimal 
in linear time 0{n), and if not, determine a Graver-best step 7g for x in quadratic 
time 0(?T,^). This is then incorporated into an iterative algorithm for solving ([T]). 

We begin with a lemma about elements of Graver bases of ra-fold products. 

Lemma 3.1 Let A be integer (r, s) x t bimatrix with Graver complexity g{A). Let 
Z{A) := |z G Z* : z is the sum of at most g (A) elements 0/^(^2)} . (4) 
Then for any n, any g G ^ (A*^"^) and any I C {1, . . . ,77,}, we have Xlie/S* ^ ^(^)- 

Proof. Consider any Graver basis element g G ^ (A*^'")) for some n. Then A^'^^g = 
and hence ^^=1 ^ig* = and A2g* = for all i. Therefore (see [171 Chapter 4]) 
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each g* can be written as the sum g* = XljLi some elements h*'-' G ^(^2) for 

all Let m := ki + ■ ■ ■ + kn and let h be the vector 

h := (hi'\...,h^''=i,...,h"'\...,h"''=") e Z"^* . 

Then Y.ij^^'^''^ = and A2h^'^ = for all i,j and hence A^^^h = 0. We claim 
that moreover, h G ^ (A*-™)). Suppose indirectly this is not the case. Then there is 
an h e ^ (.4(™)) with h C h. But then the vector g G Z"* defined by g^ := Xl^Li h''^ 
for all i satisfies g C g contradicting g G ^ (yl^")). This proves the claim. Therefore, 
by Definition 12.21 of Graver complexity, the number of nonzero bricks h^'-' of h is at 
most g{A). So for every / C {1, . . . , n}, we have that S* = Xlie/ Sj=i ^ 
sum of at most g{A) nonzero elements h*'-' G Q{A2) and hence ^ ^(^)- I— ' 

Example 3.2 Let A := A{3) be the (9, 6) x 9 bimatrix mentioned in the introduc- 
tion, which arises in the universality of ra-fold integer programming discussed further 
in Section [TJ having first block Ai = Ig the 9x9 identity matrix and second block 
the following 6x9 incidence matrix of the complete bipartite graph -^^3^3, 

/lOOlOOlOOX 
1 1 1 
001001001 
^ ~ 111000000 
1 1 1 

yooooooiii/ 

Since A2 is totally unimodular, its Graver basis Q{A2) consists of the 30 vectors in 
{0, ±1}^ supported on circuits of ^^3,3 with alternating ±1, see [17]. Also, it is known 
that the Graver complexity of this bimatrix is g{A) = 9, see [21 [ISj- Therefore, the 
set Z{A) in (j4]) which corresponds to A, consists of all sums of at most 9 such circuit 
vectors, and turns out to be comprised of 42931 vectors in Z^, such as 

( 9 -2 -7 -4 5 -1 -5 -3 8 ). 

We now define a dynamic program, that is, a weighted digraph, which will enable 
to find a Graver-best step 7g for a feasible point x of ([I]) or detect that none exists. 

Definition 3.3 (the dynamic program) Let A be a fixed (r, s) xt bimatrix and 
let g{A) be its Graver complexity. Given n, w,b,l,u, feasible point x in ([1]), and 
positive integer 7, define a weighted digraph as follows. Its vertices are partitioned 
into n + 1 stages defined in terms of the fixed finite set Z{A) C Z* in (j4]), by 

So := {0} , S,:=S2:= ■■■ := := Z{A) , 5„ := {z G Z{A) : A,z = 0} . 

Denote the vertices of Si by h* G Z*. Introduce an arc (h*~^, h*) from h*^^ G Si^i to 
h* G Si if g* := h* - h*~^ G Z{A) and F < x* 7g* < u*, and give it weight w*g\ 
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To each dipath h = (h°, h^, . . . , h") from 5*0 to Sn in this digraph we associate 
a vector g(h) := (h^ - h°, . . . , h" - h"~^) G Z"*. Note that G Z{A) and hence the 
trivial path h = (0, . . . , 0) with weight and vector g(h) = (0, . . . , 0) always exists. 
Note also that wg(h) = Yl^=i w*(h* — h*~^) is precisely the weight of the dipath h. 

The following lemma relates this dynamic program to Graver augmentations. 

Lemma 3.4 A feasible step 7g for x which satisfies w(x + 7g) < w(x + 7g) for 
any feasible step 7g with g G ^ (A^"^) can be constructed in linear time 0{n). 

Proof. Let h be a minimum weight dipath from 5*0 to Sn and let g := g(h) be 
the vector associated with h. We claim that 7g is the desired feasible step for x. 

We begin with the complexity statement. Since A is fixed, so is g{A), and hence 
so is each Si. As the digraph is acyclic, the minimum weight dipath from 5*0 to 
h* G Si decomposes into a minimum weight dipath from 5*0 to some h*~^ G Si^i 
plus the arc from h*~ to h\ Thus, we have to check at most a constant number 
■ \Si\ of such pairs (h*~^,h*) to find the minimum weight dipaths from 5*0 to 
every h* G Si given the minimum weight dipaths from Sq to every h*~^ G Si^i. 
Repeating this for each of the sets 5*1, . . . , S'„ one after the other takes 0{n) time. 

We next show that 7g is a good feasible step. Since g* = h' — h'^"*^ and (h*~^, h*) 
is an arc, we have F < x* + 7g' < u*, and g* G Z{A) and hence A2g* = 0, for all i. 
Also, Y.l=i g' = h" G Sn and hence YJ'i=i ^ig' = ^ih" = 0. So x + 7g is feasible 
in ([1]). Moreover, wg = X]r=i^*g* weight of the minimum weight dipath h. 

Now consider any feasible step 7g for x with g G ^ (A*^")). Define h* := Xljxig"' 
all i. Then h* G Z{A) for all i by Lemma [3.11 Moreover, Aih" = Ai X]j=i g'^ = 0- 
Therefore G Si for all i. Furthermore, h* — h* ^ = g* G Z{A) and 1* < x*+7g* < u* 
and therefore (h* ^, h*) is an arc of weight w*g* for all i. So h = (h°, h^, . . . , h") is a 
dipath from 5*0 to 5*^ with weight wg = X]r=i'^*g* ^"^^ associated vector g(h) = g. 
Since h is a minimum weight dipath, wg < wg and so w(x + 7g) < w(x + 7g). □ 

Remark 3.5 (optimality certification in linear time) As noted in Section [21 
a feasible point x in integer program ([T]) is optimal if and only if there is no Graver 
augmenting step g G ^ (A'-"-*) for x. Thus, with 7 := 1, Lemma [3.41 implies that the 
optimality of a feasible point x in ([1]) can be determined in linear time 0{n). 

The next lemma shows that we can quickly find a Graver-best augmentation. 

Lemma 3.6 A feasible step 7g for x satisfying w(x + 7g) < w(x + 7g) for any 

feasible step 7g with 7 G and g G ^ (A*-'"^) can be found in quadratic time O(n^). 



Proof. If Z{A) = {0} then Q (A^")) = by Lemma [SH so 7g := will do. 
Otherwise, construct a set F of 0{n) positive integers in 0{n) time as follows: for 
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every i = 1, . . . ,n and every z G Z{A) \ {0} determine the largest positive integer 
7 such that V < x* + 7Z < and include it in F. Now, for each 7 G F, construct 
and solve the corresponding dynamic program, resulting in total of O(n^) time by 
Lemma 13.41 Let 7g be that feasible step for x which attains minimum value W7g 
among the best steps obtained from all these dynamic programs, and let 7g be that 
feasible step for x which attains minimum value W7g among 7g with g G ^ 
if any. Assume that W7g < as otherwise we are done since W7g < W7O = 0. Then 
7 is the largest positive integer such that 1 < x + 7g < u since otherwise the step 
(7 + l)g will be feasible and better. So for some i = 1, . . . ,n, it must be that 7 
is the largest positive integer such that F < x* + 7g* < u*. Since g G ^ (A*^"^), it 
follows from Lemma [3.11 that g* G Z{A). Therefore 7 G F. Now let 7g be the best 
step attained from the dynamic program of 7. Then W7g < W7g by choice of 7g 
and W7g < W7g by Lemma [3.41 Therefore w(x + 7g) < w(x + 7g) as claimed. □ 

We next show, following [TU], that repeatedly applying Graver-best augmenting 
steps, we can augment an initial feasible point for ([1]) to an optimal one efficiently. 

Lemma 3.7 For any fixed himatrix A there is an algorithm that, given n, w, b, 1, u, 
and feasible point x for (Qp, finds an optimal solution x* for (Qp in time 0{n^L). 

Proof. Iterate the following: find by the algorithm of Lemma [331 a Graver-best step 
7g for x; if it is augmenting then set x := x + 7g and repeat, else x* := x is optimal. 

To bound the number of iterations, following [10], note that while x is not op- 
timal, and X* is some optimal solution, we have that x* — x = Yli=i 7«Si ^ 
nonnegative integer combination of Graver basis elements gj G ^ (^A^^^^ all lying in 
the same orthant, and hence each x-t-7jgj is feasible in ([T]). Moreover, by the integer 
Caratheodory theorem of [3], [19], we can assume that k < 2{nt — 1). Letting 7jgj be 
a summand attaining minimum W7jgj, and letting 7g be a Graver-best augmenting 
step for X obtained from the algorithm of Lemma 13.61 we find that 

1 

w(x + 7g)-wx < w(x 7igi) - wx < ^ (wx* - wx) . 

So the Graver-best step provides an improvement which is a constant fraction of the 
best possible improvement, and this can be shown to lead to a bound of 0{nL) on 
the number of iterations to optimality, see [10] for more details. Since each iteration 
takes O(n^) time by Lemma [3. 6[ the overall running time is 0{n^L) as claimed. □ 

We next show how to find an initial feasible point for ([T]) with the same com- 
plexity. We follow the approach of [S] using a suitable auxiliary ra-fold program. 

Lemma 3.8 For any fixed himatrix A there is an algorithm that, given n, b, 1, u, 
either finds a feasible point x for ^ or asserts that none exists, in time 0{n^L). 
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Proof. Construct an auxiliary n-fold integer program 

min { wz : ^("^z = b,T<z<u,zG Z"(*+2r-+2.) | 
as follows. First, construct a new fixed (r, s) x (t + 2r + 2s) bimatrix A with 

Ai := ( Ai Ij. —Ir Orxs Orxs ) , A2 [ A2 Osxr ^sxr h ~Is ) • 

Now, the n{t + 2r + 2s) variables z have a natural partition into nt original variables 
X and n(2r+2s) new auxiliary variables y. Keep the original lower and upper bounds 
on the original variables and introduce lower bound and upper bound ||b||oo on 
each auxiliary variable. Let the new objective wz be the sum of auxiliary variables. 
Note that the binary length of the auxiliary program satisfies L = 0{L) and an 
initial feasible point z with x = for ([5]) with the original b is easy to construct. 
Now apply the algorithm of Lemma 13.71 and find in time 0{n^L) = 0{n^L) an 
optimal solution z for If the optimal objective value is then y = and x is 
feasible in the original program ([T]) whereas if it is positive then ([T]) is infeasible. □ 

We can now obtain the main result of this article. 

Theorem 3.9 For every fixed integer (r, s) x t bimatrix A, there is an algorithm 
that, given n, vectors w, 1, u G Z"* and b G 77'^^''^ having binary encoding length 
L := (w, b, 1, u, ), solves in time 0{n^L) the n-fold integer programming problem 

min {wx : ^("^x = b, l<x<u, xG Z'^*} . 

Proof. Use the algorithm of Lemma 13.81 to either detect infeasibility or obtain 
a feasible point and augment it by the algorithm of Lemma 13.71 to optimality. □ 

4 Extensions to nonlinear objectives 

Here we extend some of our results to programs with nonlinear objective functions, 
min {/(x) : ^("^x = b , 1 < x < u , x G Z"*} . (6) 

A function / : R"* — )■ M is separable convex if /(x) = Yll^=i /*(^*) = ^^=i Z]j=i fji^]) 
with each /j univariate convex. In it was shown that Graver bases provide opti- 
mality certificates for problem with separable convex functions as well: a feasible 
point X is optimal if and only if there is no feasible Graver step g for x which satisfies 
/(x + g) < /(x). This was used in [10] to provide polynomial time procedures for 
optimality certification and solution of problem ([6]) with separable convex functions 
/. However, this involved again checking each of the 0{n^^^^) elements of Q (A^"-)). 

Our results from Section [3] can be extended to provide linear time optimality 
certification for separable convex functions and a cubic time solution of ([6]) for 
separable convex piecewise affine functions. We discuss these respectively next. 
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Optimality certification for separable convex objectives 

Here we assume that the objective function / is presented by a comparison oracle 
that, queried on two vectors x, y, asserts whether or not /(x) < /(y). The time 
complexity now measures the number of arithmetic operations and oracle queries. 

Theorem 4.1 For any fixed bimatrix A, there is an algorithm that, given n, b, 1, u, 
separable convex f presented by comparison oracle, and feasible point x in program 
1^, either asserts that x is optimal or finds an augmenting step g for x which 
satisfies /(x + g) < /(x + g) for any feasible step g E G (A*^"^), in linear time 0{n). 

Proof. Given the feasible point x, set a dynamic program similar to that in Defini- 
tion l3.3t with 7 := 1, with the only modification that the weight of arc (h*~^, h*) from 
h*-^ G Si-i to G Si is now defined to be /*(x^ + g^) - /^(x^) with g^ := h* - h^~^ 
Then, for every dipath h and its associated vector g := g(h), we now have 

n 

/(x + g)-/(x) = ^(f(x^ + g^)-f(x^)) = weight of dipath h. 

i=l 

We now claim that the desired step is the vector g := g(h) associated with a 
minimum weight dipath h in this dynamic program. Indeed, an argument similar to 
that in the proof Lemma [3.41 now implies that for any feasible step g G ^ (A*^")) we 
have /(x + g) — /(x) < /(x + g) — /(x) and therefore /(x + g) < /(x + g). □ 

Optimization of separable convex piecewise aflflne objectives 

In [To] it was shown that problem ([6]) can be solved for any separable convex function 
in polynomial time, but with very large dependency of 0{n^^^'^) of the running 
time on n, with the exponent g{A) depending on the bimatrix A. Here we restrict 
attention to separable convex objective functions which are piecewise afiine, for 
which we are able to reduce the time dependency on n to 0{n^) independent of A. 

So we assume again that /(x) = ^"=i/*(x*) = Y^^=iY^j=i fji^]) with each 
/j : M — 7- M univariate convex. Moreover, we now also assume that for some fixed 
p, each /j is p-piecewise afiine, that is, the interval between the lower bound 1* 
and upper bound u* is partitioned into at most p intervals with integer end-points, 
and the restriction of /j to each interval k is an afiine function ^x* + a* f. with 
all a* integers. We denote by (/) the binary length of / which is the sum 
of binary lengths of all interval end-points and a* ^i. needed to describe it. The 
binary length of the input for the nonlinear problem ([6]) is now L := (/, b, 1, u). 

Theorem 4.2 For any fixed p and bimatrix A, there is an algorithm that, given n, 
b, 1, u, and separable convex p -piecewise affine f , solves in time 0{n^L) the program 

min{/(x) : ^("^x = b , l<x<u, x G Z"*} . 
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Proof. We need to establish analogs of some of the lemmas of Section [3] for 
such objective functions. First, for the analog of Lemma [331 proceed as in the proof 
of Theorem 14.11 above: given a feasible point x and positive integer 7, set again a 
dynamic program similar to that in Definition 13.31 with the weight of arc (h*~ , h') 
from h^~^ e Si-i to G Si defined to be /*(x^ + 7g^) - with := h' - h' \ 

A similar argument to that in the proof of Lemma 13.41 now shows that 7g with 
g := g(h) the vector associated with the minimum weight dipath h is a feasible step 
for X which satisfies /(x+7g) < /(x+7g) for any feasible step 7g with g E Q (A'-"''') . 

For the analog of Lemma 13. 6[ we construct again a set F of critical step sizes 
7 as follows. First, as in the proof of Lemma 13. 6^ we collect the critical step sizes 
due to the lower and upper bound constraints by finding, for every i = 1, . . . ,n and 
every z G Z{A) \ {0}, the largest positive integer 7 such that T < x* + 7Z < u*, 
and include it in F. However, in contrast to Lemma 13. 6[ we are now dealing with 
the more general class of piecewise affine objective functions /j. So we must add 
also the following values 7 to F: for every i = 1, . . . ,n, every z G Z{A) \ {0}, and 
every j = 1, . . . ,t, if • + 'jzj and + (7 + l)zj belong to different affine pieces 
of fj, then 7 is included in F. Since the number p of affine pieces is constant, the 
number of such values for each i, z and j is also constant. So the total number of 
elements of F remains linear and it can be constructed in linear time 0{n) again. 
Now we continue as in the proof of Lemma 13.61 for each 7 in F, using the analog 
of Lemma 13.41 established in the first paragraph above, we solve the corresponding 
dynamic program in 0{n) time, resulting in total of 0(?t,^) time again. Let 7g be 
that feasible step for x which attains minimum value /(x + 7g) among the best 
steps obtained from all these dynamic programs, and let 7g be that feasible step 
for X which attains minimum value /(x + 7g) among 7g with g G ^ (A*-"^) if 
any. It now follows from the construction of F that if 7g is augmenting, namely, if 
/(x + 7g) — /(x) < 0, then 7 G F, as otherwise the step (7 + l)g will be feasible 
and better. Let 7g be the best step attained from the dynamic program of 7. Then 

/(x + 7g) < /(x + 7g) < /(x + 7g) 

where the first inequality follows from the choice of 7g and the second inequality 
follows from the analog of Lemma 13.41 Therefore /(x + 7g) < /(x + 7g). 

For the analog of Lemma 13771 we use the results of [10] incorporating the optimal- 
ity criterion of [16], which assure that the number of iterations needed when using 
a Graver-best augmenting step at each iteration, is bounded by 0{n (/)) = 0{nL), 
resulting again in overall time complexity 0{n^L) for augmenting an initial feasible 
point to an optimal solution of ([6]). Since an initial feasible point if any can be found 
by Lemma 13.81 as before in the same complexity, the theorem now follows. □ 
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5 Some consequences 

Here we briefly discuss two of the many consequences of n-fold integer programming 
which, now with our new algorithm, can be solved drastically faster than before. 

Nonlinear multicommodity transportation 

The multicommodity transportation problem seeks minimum cost routing of / com- 
modities from m suppliers to n consumers subject to supply, consumption and ca- 
pacity constraints. For / = 1 this is the classical transportation problem which is 
efficiently solvable by linear programming. But already for / = 2 it is NP-hard. Here 
we consider the problem with fixed (but arbitrary) number / of commodities, fixed 
(but arbitrary) number m of suppliers, and variable number n of consumers. This 
is natural in typical applications where few facilities serve many customers. 

The data is as follows. Each supplier i has a supply vector s* G 1j\ with s\ 
its supply in commodity k. Each consumer j has a consumption vector (9 G llj^ 
with its consumption in commodity k. The amount of commodity k to be routed 
from supplier i to consumer j is an integer decision variable x^^.. The total amount 

Y^k=i k °f commodities routed on the channel from i to j should not exceed the 
channel capacity Wjj, and has cost fij ^XlL=i^iA:) suitable univariate functions 
fij. We can handle standard linear costs as well as more realistic, convex piecewise 
affine cost functions /j j, which account for channel congestion under heavy routing. 

As a corollary of Theorem 14.21 for any fixed numbers / of commodities and m of 
suppliers, the problem can be solved in time cubic in the number n of consumers. 

Corollary 5.1 For every fixed I commodities, m suppliers, and p, there exists an 
algorithm that, given n consumers, supplies and demands s\c^ G capacities 
Uij G and convex p -piecewise affine costs fij : Z — )■ Z, solves in time 0{n^L), 
with L := {s\c^ ,Uij, fij) , the integer multicommodity transportation problem 

{m n / I \ I 

i=l j=l \k=l ) j i k=l 

Proof. Introduce new variables yj and equations yj = X]fc=i ^ik ^'i- Then 

the objective function becomes Ylii j fid^vl) '^hich is separable convex p-piecewise 
affine in the new variables, and the capacity constraints become yl < Uij which are 
upper bounds on the new variables. Use Uij as an upper bound on ^ and as a 
trivial lower bound on yl for all i,j, k. As shown in [11], arranging the original and 
new variables in a tuple z = (z^, . . . , z") of n bricks z^ G Z™*^'"*"^^ defined by 
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this problem can be modeled as an ra-fold program, resulting in solution in large time 
O (^n^^^^Lj, with exponent depending on l,m. Theorem 14.21 now enables solution in 
cubic time independent of the numbers / of commodities and m of suppliers. □ 

Privacy in statistical databases 

A common practice in the disclosure of sensitive data contained in a multiway table 
is to release some of the table margins rather than the entries of the table. Once the 
margins are released, the security of any specific entry of the table is related to the set 
of possible values that can occur in that entry in all tables having the same margins 
as those of the source table in the database, see [71 [20] and the references therein. In 
particular, if this set is small or consists of a unique value, that of the source table, 
then this entry can be exposed. Thus, it is desirable to compute the minimum and 
maximum integer values that can occur in an entry, which in particular are equal if 
and only if the entry value is unique, before margin disclosure is enabled. 

Consider {d+ l)-way tables of format mo x ■ ■ ■ x nid, that is, arrays v = (t'jo,...,i^) 
indexed by 1 < ij < rrij for all j, with all entries nonnegative integers. Our 

results hold for arbitrary hierarchical margins, but for simplicity we restrict attention 
to disclosure of (i-margins, that is, the d+1 many d-waj tables ('yjo,,,.^j 
obtained from v by collapsing one factor < j < d at a time, with entries given by 

mj 

Vio,...,ij.l,*,ij+l,...,ia ■= ''^^'Vio,---,ij-i,ij,ij+l,---,id ^ 1 < "^fc < ''^fc , 0<k<d, k^j . 
ij=l 

The problem is then to compute the minimum and maximum integer values that 
can occur in an entry subject to the margins of the source table in the database. 

This problem is NP-hard already for 3-way tables of format n x m x 3, see [Hj- 
However, as a corollary of Theorem 13. 9[ if only one side n of the table is variable, 
the problem can be solved in cubic time regardless of the other sides as follows. 

Corollary 5.2 For every fixed d,mi, . . . ^m^, there is an algorithm that, given n, 
integer d-margins . . . , (wjo,...,irf_i,*), and index {ko, . . . , kd), determines, in 

time 0{n^L), with L the binary length of the given margins, the minimum and 
maximum values of entry XkQ,...,ka o,mong all tables with these margins, that is, solves 

mm/ max |a;fco,...,fc^ : a; fc ^+ , l,Xjo,...,i^._j,*,i^.^^,...,j^j — l^f jo,...,ij_i,*,ij+i,...,jdJ "Jj • 

Proof. Let u be the maximum value of any entry in the given margins and use it 
as an upper bound on every variable. As shown in [5], this problem can be modeled 
as an n-fold integer programming problem, resulting in solution in large running 
time O {n^^^^Lj, with exponent which depends on mi, . . . ,md. Theorem 13.91 now 
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enables to solve it in cubic time independent of the table dimensions mi, ... , m^. □ 

We note that long tables, with one side much larger than the others, often arise 
in practical applications. For instance, in health statistical tables, the long factor 
may be the age of an individual, whereas other factors may be binary (yes-no) 
or ternary (subnormal, normal, and supnormal). Moreover, it is always possible 
to merge categories of factors, with the resulting coarser tables approximating the 
original ones, making the algorithm of Corollary 15.21 applicable. 

We also note that, by repeatedly incrementing a lower bound and decrementing 
an upper bound on the entry Xko,...,kaJ computing its new minimum and maxi- 
mum values subject to these bounds, we can produce the entire set of values that 
can occur in that entry in time proportional to the number of such values. 

6 Solvability over bimatrices with variable entries 

The drop of the Graver complexity from the exponent of n to the constant multiple 
also leads to the first polynomial time solution of ra-fold integer programming with 
variable bimatrices. Of course, by the universality of ra-fold integer programming, the 
variability of the bimatrices must be limited. In what follows, we fix the dimensions 
r, s, t of the input bimatrix A, and let the entries vary. We show that, given as part 
of the input an upper bound a on the absolute value of every entry of A, we can 
solve the problem in time polynomial in a, that is, polynomial in the unary length 
of a. This holds for linear as well as separable convex piecewise affine objectives. 

We have the following theorem, with L := (/, a, b, 1, u) the length of the input. 

Theorem 6.1 For any fixed r,s,t and p, there is an algorithm that, given n, a, 
(r, s) X t bimatrix A with all entries bounded by a in absolute value, b, 1, u, and 
separable convex p-piecewise affine f, in polynomial time 0{a^*^^^^^^~^^^^^n^L), solves 

min{/(x) : A^^^x = b , l<x<u, x G Z"*} . 

Proof. Let Q{A2) be the Graver basis of the s x t second block A2 of A, let 
p := 1^(^42) I be its cardinality, and arrange its elements as the columns oi a. t x p 
matrix G2- Since r,s,t are fixed, it follows from bounds on Graver bases (see e.g. 
[T71 Section 3.4]) that every g G ^(^2) satisfies llsfHoo = 0(a*) and hence p = 0(a**). 

Now, it is known (see [I3l [18] or [T71 Section 4.1]) that the Graver complexity 
g{A) of A is equal to the maximum value of any element v in the Graver basis 
Q {A1G2) of the rxp matrix AiG2- Since the entries of A1G2 are bounded in absolute 
value by 0(0"*+^), the bounds on Graver bases (see again [171 Section 3.4]) imply 
that = 0{p ■ (a'+^Y) for every v G ^(^1^2) and hence g{A) = 0(a"+'**+''). 
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Now, consider again the following set defined in (jlj) in Lemma 13. H 



Z{A) := {z G Z* : z is the sum of at most g{A) elements of ^(^2)} . 



For each z G Z{A) we have that H^Hoo is bounded by g{A) times the maximum value 
of ||c/||oo over all g G ^^(^2), and therefore \\z\\^ = 0{g{A) ■ a') = 0(a^'^+^*+''+^). So 
the cardinality of Z{A) satisfies \Z{A) \ = 0((a"+^*+'^+^)*) = 0(a*('^"+"*+'"+")). 

Now, suitable analogs of some of the lemmas of Section |3]go through, except that 
the complexities now depend on the variable size of the set Z{A), as follows. The 
time complexity of the algorithm of Lemma [3.41 becomes 0{\Z{A)\'^n). The size of 
the set r of critical values is now 0{\Z{A)\n) and therefore the time complexity of 
the algorithm of Lemma [3.61 now becomes 0{\Z{A)\^n'^). The number of iterations 
needed to augment an initial feasible point to an optimal solution remains 0{nL) 
as before and therefore the time complexity of the algorithm of Lemma 13.71 now 
become 0{\Z{A)\^n^L). To find an initial feasible point, one can use the algorithm 
of Lemma 13.81 but then t would have to be replaced by t + 2r + 2s for the auxiliary 
bimatrix A, resulting in a somewhat larger exponent for a in the running time. 
However, it is possible to find an initial feasible point in an alternative, somewhat 
more involved way, keeping the original system with the bimatrix A, as follows. 
First find an integer solution to the system of equations only (without the lower and 
upper bounds) using the Hermite normal forms of the blocks Ai and A2. Second, 
relax the bounds so as to make that point feasible. Third, minimize the following 
auxiliary objective function which is separable convex 3-piecewise affine, with 



If the optimal value is zero then the optimal auxiliary solution is feasible in the 
original problem, whereas if it is positive then the original problem is infeasible. 
Since this minimization can be done using the separable convex piecewise affine 
analog of Lemma 13.71 described in the proof of Theorem 14.21 in the same complexity 
0{\Z(A)\'^n^L), the overall running time is 0{a^^^'^^~^^^~^'^~^^^n^L) as claimed. □ 



7 Parametrization and approximation hierarchy 

We conclude with a short discussion of the universality of n-fold integer programming 
and the resulting parametrization and simple approximation hierarchy for all of 
integer programming. As mentioned in the introduction, every integer program is an 
n-fold program for some m over the bimatrix A{m) having first block the identity 
matrix I^^ and second block the (3 + m) x 3m incidence matrix of K^ jji- It is 
convenient and illuminating to introduce also the following description. 
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Consider the following special form of the ra-fold product operator. For an s x t 
matrix D, let -0^"' := A*-'"^ where A is the {t,s) x t bimatrix A with first block 
Ai := If the t x t identity matrix and second block A2 := D. We consider such 
m-fold products of the 1x3 matrix (1 1 1). Note that (1 1 1)^™] is precisely the 
(3 + m) X 3m incidence matrix of the complete bipartite graph K^^m- For instance, 



The following theorem was established in [B]. 

The Universality Theorem [6] Every (hounded) integer programming problem 
min{cy : y G Z^, Vy = v} is polynomial time equivalent to some integer program 



min {wx : x G Z^^!"", (1 1 l)HMx = b} = min {wx : x G Z^^!"", A(m)(")x = b} . 



This theorem provides a new, variable dimension, parametrization of integer 
programming: for each fixed value of the parameter m, the resulting programs above 
with n variable live in variable dimension 3mn and include natural models such as 
those described in Section [5l and can be solved in cubic time 0{n^L) by Theorem 
I3.9j and when the parameter m varies, every integer program appears for some m. 

Our new algorithm suggests a natural simple approximation hierarchy for integer 
programming, parameterized by degree d, as follows. Fix any d. Then given any m, 
let A := A{m), so t = 3m, Ai = I^^n, and A2 is the incidence matrix of K^ ^- Define 
the approximation at degree d of the set Z{A) in equation (jl]) in Lemma [3.11 by 

Z(i{m) := {z G Z'^™ : z is the sum of at most d elements of Q{A2)^ . (7) 

Since A2 is totally unimodular, Q{A2) consists of the O(m^) vectors in {0, ±1}^"* 
supported on circuits of K^.^ with alternating ±1 and hence |Zrf(m)| = 0{m?'^). 

Now, given a feasible point x in the universal program above, and positive integer 
7, set a dynamic program similar to that in Definition l3.3[ with the only modification 
that the sets Si are defined using the approximation Z^^m) of Z{A). Weaker forms 
of Lemmas 13.41 and 13.61 now assert that in time 0{\Zii{rn)\^n'^) = 0{mP'^n'^), which 
is polynomial in both m and n, we can find a good feasible step 7g. We use this 
iteratively to augment an initial feasible point to one which is as good as possible and 
output it. However, Lemma [3.11 no longer holds and not all brick sums of elements 
of the Graver basis Q{A^"'^) lie in Z^im). So the bounds on the number of iterations 
and total running time are no longer valid and the output point may be non optimal. 
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By increasing the degree d we can get better approximations at increasing run- 
ning times, and when d = g{A) we get the true optimal solution. These approxima- 
tions are currently under study, implementation and testing. They show promising 
behavior already at degree d = 3 and will be discussed in more detail elsewhere. 

For m = 3, discussed in Example 13. 2[ for which the universal problem is equiva- 
lent to optimization over 3- way n x 3 x 3 tables, the approximation Z^{3) at degree 
d = 3 contains only 811 vectors out of the 42931 vectors in the true Z{A), such as 

(-3212-3111-2). 
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